Maintenance of Discovered Knowledge: A Case in Multi-Level Association Rules

نویسندگان

  • David Wai-Lok Cheung
  • Vincent T. Y. Ng
  • Benjamin W. Tam
چکیده

Many knowledge discovery (kdd) systems need to spend substantial amount of eeort to search for rules and patterns within large amount of data. After some natural evolutions, as a consequence of updates applied to their databases, these systems must update their previously discovered knowledge to reeect the current state of their databases. The straight forward approach of re-running the discovery process on the whole updated database to rediscover the rules and patterns is not cost-eeective in general, and is unacceptable in many cases. We have studied the problem of updating discovered association rules and found that it is nontrivial, because updates may not only invalidate some existing strong association rules but also turn some weak rules into strong ones. An incremental technique and a fast algorithm FUP have been proposed previously for the update of discovered single-level association rules. In this study, a more eecient algorithm FUP*, which generates a smaller number of candidate sets when comparing with FUP, has been proposed. In addition, we have demonstrated that the incremental technique in FUP and FUP* can be generalized to some other kdd systems. An eecient algorithm MLUp has been proposed for this purpose for the updating of discovered multi-level association rules. Our performance study shows that MLUp has a superior performance over the representative mining algorithm such as ML-T2 in updating discovered multi-level association rules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Multi-level Spatial Association Rules Through a Graph-Based Visualization

Association rules discovery is a fundamental task in spatial data mining where data are naturally described at multiple levels of granularity. ARES is a spatial data mining system that takes advantage from this taxonomic knowledge on spatial data to mine multi-level spatial association rules. A large amount of rules is typically discovered even from small set of spatial data. In this paper we p...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining

Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

Multi-Level Mining and Visualization of Informative Association Rules

Recently, there has been an increasing interest in applying association rule mining on data warehouses to identify trends and patterns that exist in the historical data present in large data warehouses. These warehouses have a complex underlying multidimensional structure and the application of traditional rule mining algorithms becomes hard. In this paper, we review and critically evaluate the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996